February 2008 
LPTHE-07-02 
arXiv:0802.1188 



The Catchment Area of Jets 



Matteo Cacciari, Gavin P. Salam 
LPTHE 
UPMC Universite Paris 6, 
Universite Paris Diderot - Paris 7, 
CNRS UMR 7589, Paris, France 

Gregory Soyez 

Brookhaven National Laboratory, Upton, NY 11973, USA 



Abstract 

The area of a jet is a measure of its susceptibility to radiation, like pileup or underlying 
event (UE), that on average, in the jet's neighbourhood, is uniform in rapidity and azimuth. 
In this article we establish a theoretical grounding for the discussion of jet areas, introducing 
two main definitions, passive and active areas, which respectively characterise the sensitivity to 
pointlikc or diffuse pileup and UE radiation. We investigate the properties of jet areas for three 
standard jet algorithms, k t , Cambridge/ Aachen and SISCone. Passive areas for single-particle 
jets are equal to the naive geometrical expectation irR 2 , but acquire an anomalous dimension 
at higher orders in the coupling, calculated here at leading order. The more physically relevant 
active areas differ from ttR 2 even for single-particle jets, substantially so in the case of the cone 
algorithms like SISCone with a Tevatron Run-II split-merge procedure. We compare our results 
with direct measures of areas in parton-shower Monte Carlo simulations and find good agreement 
with the main features of the analytical predictions. We furthermore justify the use of jet areas 
to subtract the contamination from pileup. 
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1 Introduction 

For nearly three decades now, jets [1] have represented the principal tool for accessing information 
about an event's partonic hard-scattering structure and kinematics. As a result of this, much 
work has been carried out on understanding the properties of jets in a range of collider contexts, 
addressing issues such as jet substructure [21 El S] , the correlations between multi-jet production 
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and the hard colour-structures present in an event and perturbative threshold corrections to 

jet production [7]. 

One issue that has been largely neglected, but that is highly relevant in a hadron collider context 
such as Tevatron or LHC, is that of the modification of jet kinematics by non-perturbative effects 
associated with the proton beams. These effects, often referred to as a whole as the "underlying 
event" (UE), are rather poorly understood. However, from tuned underlying event models [8, 9j one 
consistently finds that a principal effect of the underlying event is to add a rather large amount of 
transverse momentum, fairly uniformly throughout the event — according to models 3 — 5 GeV per 
unit rapidity at the Tevatron, 10 — 15 GeV per unit rapidity at LHC (see e.g. |10j). with substantial 
fluctuations in the amount from one event to another. This is an order of magnitude larger than the 
normal scale for non-perturbative effects in an e + e~ or DIS context, which amount to ~ 0.5 GeV 
per unit rapidity (with respect to the qq axis) I12j . Thus for jets with transverse momenta, 
Pt, of several tens of GeV, the UE can be as significant as the perturbative corrections to the jet 
transverse momentum a s pt- A related issue is that of pileup (PU), minimum-bias collisions that 
occur in the same bunch crossing as the main event, with the potential of further adding up to 
100 GeV of soft radiation per unit rapidity. 

Given the large momentum scales associated with the underlying event and pileup, it is impor- 
tant to develop tools for understanding how they both affect jets. Of the two, PU is conceptually 
simpler because the particles it adds are entirely uncorrelated with those of the hard scatter. In 
contrast UE particles cannot entirely be disentangled from those due to the hard scatter (neither 
theoretically, nor in practice). Nevertheless it is useful, and probably not too poor an approxima- 
tion, to treat the UE as largely independent from the hard scatter, just like PU. 

The UE and PU can affect a jet in two ways. Firstly, particles from the UE and PU may be 
clustered into the jet, increasing its transverse momentum. Secondly, the presence of the UE/PU 
particles can modify the way in which the particles from the hard scatter get clustered into jets. 

To study the first of these effects, we shall introduce the concept of the 'jet area'. The logic that 
motivates this particular quantity is that UE /PU particles are distributed uniformly in rapidity and 
azimuth, at least after averaging over many events. Therefore a good measure of a jet's susceptibility 
to contamination by the UE and PU should be given by the extent of the region (on the rapidity- 
azimuth cylinder) over which it is liable to capture UE and pileup particles, i.e. the jet's "catchment 
area" . 

One might naively think of this area as the area of the surface covering all the particles that 
make up the jet. However, a moment's reflection shows that this area is actually zero, the jet 
being made of pointlike particles. An alternative obvious definition would be to take the area of 
the convex hull surrounding all the particles in a jet. However this definition also fails to satisfy 
basic sensibleness requirements: for example, if two jets share an irregular boundary it is possible 
that their convex hulls will overlap and the assignment of area to one or other of the jets becomes 
ambiguous. 

Two main definitions of the jet area can be introduced without such shortcomings. A first one, 
the passive area, involves scanning a single infinitely soft particle (a "ghost") over the rapidity- 
azimuth cylinder and determining the region over which it is clustered within a given jet. It can 
be understood as a measure of the susceptibility of the jet to contamination from an UE with 
pointlike structure. It is discussed in detail in section [2j A second definition, studied in section O 
the active area, involves adding a dense coverage of infinitely soft "ghosts" and counting how many 
are clustered inside a given jet. It can be understood as a measure of the susceptibility of the jet 
to contamination from an UE with uniform, diffuse structure. The passive and active areas differ 
because in the latter case the ghosts can cluster among themselves as well as with the actual event 
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particles, thus playing a more active role in the clustering. We will find that the numerical value of 
the jet area can differ according to the kind of particles which make up the jet. The simplest case 
to study will be that of jets with a single hard particle. We shall then consider how the jet's area 
changes when its main hard parton emits an additional soft gluon, which effectively causes the jet 
area to acquire an anomalous dimension. We will also study "pure ghost jets," those exclusively 
made up of ghost particles. 

In section H] we shall then consider how the presence of UE/PU affects the clustering into jets of 
particles from the hard scatter, an effect that we refer to as the back-reaction of the UE/PU on the 
jet structure. Again we shall examine this in two limits: pointlike and perfectly diffuse UE/PU. 

As well as making approximations for the structure of the UE/PU it will also be necessary to 
make simplifying approximations for the jet structure. Thus we will work in an energy ordered 
limit, in which the perturbative event consists of one very hard particle, plus an optional additional 
soft perturbative particle. This will be equivalent to making a leading (single) logarithmic approx- 
imation, truncated at its first non-trivial order in a s . For some purposes we shall also assume a 
small jet-radius parameter R (i.e. the small-cone approximation of |13j). 

Despite these many simplifications, the calculations that we present will be seen to provide con- 
siderable insight into the mechanisms at play in jet clustering in events with UE/PU. In particular 
they will help highlight characteristic analytical structures that are common to a range of rather 
different jet algorithms, as well as significant jet-algorithm-dependent differences in the quantita- 
tive impact of these analytical structures. Of the algorithms that we will consider, two are based 
on sequential recombination (the inclusive kt [H] and Cambridge/Aachen [15] algorithms), while 
the third is a modern, infrared-safe stable-cone algorithm (SISCone [16J) with a Tevatron Run-II 
type split-merge procedure to resolve overlapping stable cones. A brief description of the three jet 
algorithms is given in Appendix [Al 

To help reinforce the connection between our analytical results and the (simulated) real world, 
we shall close the article in section [5] with Herwig |17j and Pythia [18] Monte Carlo studies. We 
shall also give the foundations of the use of the area concept for performing PU subtractions [19J. 

2 Passive Area 

Suppose we have an event composed of a set of particles {pi} which are clustered into a set of jets 
{ Ji} by some infrared-safe jet algorithm. 

Imagine then adding to the {pi} a single infinitely soft ghost particle g at rapidity y and azimuth 
<f>, and repeating the jet-finding. As long as the jet algorithm is infrared safe, the set of jets {Ji} 
is not changed by this operation: their kinematics and hard particle content will remain the same, 
the only possible differences being the presence of g in one of the jets or the appearance of a new 
jet containing only g. 

The passive area of the jet J can then either be defined as a scalar 

a{J) = J dyd4f{j3M),J) f(9,J) = ll H j > W 

which corresponds to the area of the region where g is clustered with J, or as a 4-vector, 

a,{J) = Jdyd<P fMv, 0, -0 M, J) = { ^ \\ j > (2) 

where gt is the ghost transverse momentum. For a jet with a small area a{J) <C 1, the 4-vector area 
has the properties that its transverse component satisfies at (J) = a(J), it is roughly massless and 



4 



it points in the direction of J. For larger jets, a(J) ~ 1, the 4- vector area acquires a mass and may 
not point in the same direction as J. We shall restrict our attention here to scalar areas because 
of their greater simplicity. Nearly all results for scalar areas carry over to 4- vector areas, modulo 
corrections suppressed by powers of the jet radius (usually accompanied by a small coefficient)^ 

2.1 Areas for 1-particle configurations 

Consider this definition in the context of three different jet algorithms: inclusive kt, inclusive 
Cambridge/Aachen, and the seedless infrared-safe cone algorithm, SISCone. The definitions of all 
three algorithms are summarised in appendix lAl 

Each of these jet algorithms contains a parameter R which essentially controls the reach of the 
jet algorithm in y and 4>. Given an event made of a single particle pi, the passive area of the jet J\ 
containing it is a{J\) = ttR 2 for all three algorithms. 

2.2 Areas for 2-particle configurations 

Let us now consider what happens to the passive jet areas in the presence of additional soft per- 
turbative radiation. We add a particle p 2 such that the transverse momenta are strongly ordered, 

Pti » Pt2 » Aqcd » 9t , (4) 

and pi and p2 are separated by a geometrical distance A12 = (yi — 1/2) 2 + (<fii — (P2) 2 in the y-(j) plane. 
Subsequently we shall integrate over A12 and pt 2 - Note that gt has been taken to be infinitesimal 
compared to all physical scales to ensure that the presence of the ghost particle does not affect the 
real jets. 

For A12 = collinear safety ensures that the passive area is still equal to ttR 2 for all three 
algorithms. However, as one increases A12, each algorithm behaves differently. 

2.2.1 k t 

Let us first consider the behaviour of the kt jet algorithm, a sequential recombination algorithm, 
which has 2-particle distance measure dij = min(fe^, k 2 j)A 2 j/R 2 and beam-particle distance diB = 
k\ [H]. Taking A12 ~ A\ g ~ A2 g ~ R and exploiting the strong ordering one has 

diB > d 2 B ~ dn » d g i ~ d g2 ~ d gB ■ (5) 

From this ordering of the kt distances, one sees that the ghost always undergoes its clustering before 
either of the perturbative particles p\ and p 2 - Specifically, if at least one of Ai 9 and A 2g is smaller 
than R, the ghost clusters with the particle that is geometrically closer. 

If both Ai g and A 2g are greater than R the ghost clusters with the beam and will not belong 
to any of the perturbative jets. 

1 The above definitions apply to jet algorithms in which each gluon is assigned at most to one jet. For a more 
general jet algorithm (such as the "Optimal" jet finder of [20] or those which perform 3 — > 2 recombination like 
ARCLUS [H]), then one may define the 4- vector area as 

a M (J)= lim - f dydcfr (JM Pi },g(y,c/>)) - ^({pi})) , (3) 

where J/j({pi}, g(y, 4 1 )) is the 4-momentum of the jet as found on the full set of event particles {pi} plus the ghost, 
while J(i({pi}) is the jet-momentum as found just on the event particles. 



5 




Cam/Aachen 



d) 




h) 



SISCone 



< A 12 < R/2 





f) 





R/2 < A 12 < R 



R < A 12 < 2R 



Figure 1: Schematic representation of the passive area of a jet containing one hard particle "1" 
and a softer one "2" for various separations between them and different jet algorithms. Different 
shadings represent distinct jets. 



Let us now consider various cases. If A12 < R, (fig. [TK,b) the particles p\ and P2 will eventually 
end up in the same jet. The ghost will therefore belong to the jet irrespectively of having been 
clustered first with p\ or p2- The area of the jet will then be given by union of two circles of radius 
R, one centred on each of the two perturbative particles, 
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for A12 < R , 



where 



u(x) = — 



7T 



./' \ 1 — ^j- + 2 (tt — arccos 



(6) 



(7) 



represents the area, divided by 7r, of the union of two circles of radius one whose centres are separated 
by x. 

The next case we consider is R < A12 < 2R, fig. [lb. In this case p\ and P2 will never be able to 
cluster together. Hence, they form different jets, and the ghost will belong to the jet of the closer 
of pi or p2- The two jets will each have area 



u(A 12 /R) 2 
afci,_R(Ai2j = irn , 



for R < Ai 2 < 2R. 



(8) 



Finally, for A12 > 2R the two jets formed by p\ and P2 each have area nR 2 . 

The three cases derived above are summarised in table [T] and illustrated in fig. [2j 
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2.2.2 Cambridge/ Aachen 

Now we consider the behaviour of the Cambridge/ Aachen jet algorithm [15], also a sequential 
recombination algorithm, defined by the 2-particle distance measure dij = A?/i? 2 and beam- 
particle distance diB = 1. Because in this case the distance measure does not involve the transverse 
momentum of the particles, the ghost only clusters first if min(Ai ff , £±2g) < A 12. Otherwise, p\ and 
P2 cluster first into the jet J, and then J captures the ghost if Aj g ~ A\ g < R. 

The region A 12 < R now needs to be separated into two parts, A 12 < R/2 and R/2 < A 12 < R. 

If the ghost clusters first, then it must have been contained in either of the dashed circles 
depicted in figs. [TU,e. If A12 < R/2 both these circles are contained in a circle of radius R centred 
on pi (fig. [TH), and so the jet area is ttR 2 . 

If R/2 < A 12 < R (fig. QJ) the circle of radius A 12 centred on p 2 protrudes, and adds to the 
area of the final jet, so that 

a C am,i ? (A 12 ) = w f ^ ) ttR 2 , for R/2 <A 12 <R, (9) 

where 

w(x) = — 

IT 

For A12 > R a Cambridge/ Aachen jet has the same area as the kt jet, cf. fig. QJ. As with the kt 
algorithm, the above results are summarised in tabled] and fig. [2] The latter in particular illustrates 
the significant difference between the kt and Cambridge/Aachen areas for Ai 2 ~ R/2, caused by 
the different order of recombinations in the two algorithms. 

2.2.3 SISCone 

Modern cone jet algorithms identify stable cones and then apply a split/merge procedure to over- 
lapping stable cones. The arguments that follow are identical for midpoint and seedless cone jet 
algorithms with a Tevatron run II type split-merge procedure [22J. For higher orders, or more 
realistic events, it is mandatory to use an infrared-safe seedless variant, a main example of which 
is SISCone pi]. 

For A12 < R only a single stable cone is found, centred on p±. Any ghost within distance R of 
pi will therefore belong to this jet, so its area will be irR 2 , cf. fig. [Qi,g. 

For R < A12 < 2R two stable cones are found, centred on p\ and p%. The split/merge procedure 
will then always split them, because the fraction of overlapping energy is zero. Any ghost falling 
within either of the two cones will be assigned to the closer of p\ and p 2 (see fig. [T})- The area of 
the hard jet will therefore be the same as for the kt and Cambridge algorithms. 

Again these results are summarised in table [T] and fig. [2] and one notices the large differences 
relative to the other algorithms at R < 1 and the striking feature that the cone algorithm only ever 
has negative corrections to the passive area for these energy-ordered two-particle configurations. 

2.3 Area scaling violation 

Given that jet passive areas are modified by the presence of a soft particle in the neighbourhood 
of a jet, one expects the average jet area to acquire a logarithmic dependence on the jet transverse 
momentum when the jets acquire a sub-structure as a consequence of radiative emission of gluons.To 
determine its coefficient we shall work in the approximation of small jet radii, motivated by the 
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a JAtR (A 12 )/TrR' 2 
Cambridge /Aachen 


SISCone 


< A12 < R/2 
R/2 <A 12 <R 
R<A 12 < 2R 
A12 > 2R 


u(A 12 /R) 
u{A 12 /R) 
u(A 12 /R) / 2 
1 


1 

w(A 12 /R) 
u(A 12 /R) / 2 
1 


1 
1 

u{A 12 /R) / 2 
1 



Table 1: Summary of the passive areas for the three jet algorithms for the hard jet in an event 
containing one hard and one soft particle, separated by a y-cj> distance A\ 2 . The functions u and w 
are defined in eqs. (|7j) and ([TO]) . 
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Figure 2: Plot of the passive areas of the hard jet as a function of the distance between the hard 
and the soft particle, as given in table [JJ 



observation that corrections for finite R, proportional to powers of R, are usually accompanied by 
a small coefficient [I3j [10] . 

In the small-angle limit, the QCD matrix element for the emission of the perturbative soft gluon, 
P2, is 

dP n 2a s ( Pt2 A 12 ) 1 1 

dp t2 dA 12 7T A 12 p t 2 

where C\ is Cp or Ca according to whether the hard particle p\ is a quark or a gluon. We make 
use of the fact that A12 is just the angle between the two particles (to within a longitudinal boost- 
dependent factor), and will assume that R is sufficiently small that the small-angle approximation 
is legitimate. The scale of the coupling is taken to be the transverse momentum of p 2 relative to 
Pi- 

At order a s the mean jet area with a given jet algorithm (JA) can be written 

<aj A ,fl) = «JA,i?(0) + {Aa JAiR ) = irR 2 + {Aa JAiR ) , (12) 
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where we explicitly isolate the O (a s ) higher-order contribution, 

f2R rpti 

(AajA, fl )~/ dA 12 dp t2 - — — (ajA,fl(Ai 2 ) - a JA) fl(0)) , (13) 

Jo JQ0/A12 d p t2 aA 12 

with the — ajA,i?(0) term accounting for virtual corrections. (• • •) represents therefore an average 
over perturbative emission. Note that because of the \jpt2 soft divergence for the emission of pt2, 
eq. (|13p diverges unless one specifies a lower limit on pt2 — accordingly we have to introduce an 
infrared cutoff Q0/A12 on the pt of the emitted gluon. This value results from requiring that the 
transverse momentum of p 2 relative to pi, i.e. pt2^-i2, be larger than Qq. The fact that we need to 
place a lower limit on p^ 2 means that jet areas are infrared unsaf^ - they cannot be calculated 
order by order in perturbative QCD and for real-life jets they will depend on the details of non- 

As concerns the finiteness of the A 12 integration, all the jet algorithms we consider have the 
property that 



perturbative effects (hadronisation). One can account for this to some extent by leaving Qq as a 
free parameter and examining the dependence of the perturbative result on Qq\ 



lim aj AlJ R(Ai2) = a JA ,R(0) = irR 2 ; (14a) 

Ai2^0 

aj A ,i?(Ai 2 ) = nR 2 for A 12 > 2R (14b) 

so that the integral converges for A12 — ► 0, and we can place the upper limit at A12 = 2R- 

After evaluating eq. (I13p . with the replacement A12 — * R, both in the lower limit of the pt2 
integral and the argument of the coupling, we obtain 

/A \ A 2a * Cl 1 Rptl A ( 2Rd9 < <a\ ^ tin 

(AajA,R) = dj A ,R In——, dj A ,R= — (a J A, J R(6') - vri? ) , (15) 

■k Qo Jo V 

in a fixed coupling approximation, and 

(AajA r) = dj A ,R^r ln aS }v°\ ' ( 16 ) 
vrbo a s {Rpti) 

with a one-loop running coupling, where 60 = ~~^r~^- The approximation A12 ~ R in the 
argument of the running coupling in the integrand corresponds to neglecting terms of O (a s ) without 
any enhancements from logarithms of R or pti/Qo- 
The results for djA,R are 

d kt ,R = (jy + I + ^) R2 ~ °' 5638 nR2 ' ^ 17a ) 

d C ,m,R = + I " 2 ?) R2 ~ °- 07918 ^ R2 ' ( 17b ) 

dsiSConefi = (~ ^ + \ ~ ^) ^ ~ "O- 06378 ^ > ( 17c ) 



2 An exception is for the SISCone algorithm with a cut, pt,min, on the minimum transverse momentum of protojets 
that enter the split-merge procedure procedure — in that situation Qo is effectively replaced by pt,min- 

3 One should of course bear in mind that the multi-particle structure of the hadron level is such that a single-gluon 
result cannot contain all the relevant physics — this implies that one should, in future work, examine multi-soft gluon 
radiation as well, perhaps along the lines of the calculation of non-global logarithms 30, 3TJE1E], though it is not 
currently clear how most meaningfully to carry out the matching with the non-perturbative regime. Despite these 
various issues, we shall see in section [5] that the single-gluon results work remarkably well in comparisons to Monte 
Carlo predictions. In that section, we shall also argue that in cases with pileup, the pileup introduces a natural 
semi-hard (i.e. perturbative) cutoff scale that replaces Qo- 
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where 

, ^'(1/6) +^'(1/3) -^'(2/3) -^(5/6) , s 

f = w 7 — 7 — !. v 7 ; r w ; ~ 0.507471 , (18) 

48^3 V ; 

with ip'(x) = d 2 1 dx 2 (lnT (x)) . One notes that the coefficient for the kt algorithm is non-negligible, 
given that it is multiplied by the quantity 2a s Ci/irlnRp t i/Qo in eq. (fl~5j) (or its running coupling 
analogue), which can be of order 1. In contrast the coefficients for Cambridge/Aachen and SISCone 
are much smaller and similar (the latter being however of opposite sign). Thus kt areas will depend 
significantly more on the jet pt than will those for the other algorithms. 

The fluctuation of the areas can be calculated in a similar way. Let us define 

(<rjA, R ) = <Oja,ji> " («JA,fl) 2 = ^a,r(°) + ( A ^a,r) , <7jA,fl(0) = , (19) 

where we have introduced cr 2 AR (0), despite its being null, so as to facilitate comparison with later 
results. We then evaluate 

(Aa 2 A)R ) = (Aa 2 JAiR ) - <Aa JA ,K> 2 * (Aa 2 AjR > , (20) 

where we neglect (Aoja,_r) 2 since it is of O (a 2 ln 2 (Rp t i/Qo)) . The calculation of (Aa 2 A R ) proceeds 
much as for (AajA,fl) and gives 

/A 2 \ 2 Cl , a s(Qo) 2 f 2R d6 2 2 

( Aa JA,R) = s JA,R~r- In —p= — r , s JA R = / -7-(aj A ,jj(e) - ttR ) (21) 
7r6 a s (Rpti) Jo v 

for running coupling. The results are 

4 uR = - H _ *m + 2 <) # * (0.4499 ,R 2 ) 2 , (22a) 

sh^R = " S " T " ^ + f * (O- 2 ^^ 2 ) 2 , (22b) 

_/Vfcr 15 vr 2 13C(3) 2vr A D , Wnnm ^_ D2 ,2 



%SCone,i? = - ei - is ~ ^4 + ~ (°- 091427rii ) • (22c) 

We have a hierarchy between algorithms that is similar to that observed for the average area scaling 
violations, though now the coefficient for Cambridge/Aachen is more intermediate between the other 
two. 



2.4 n-particle properties and the Voronoi area 

The only algorithm for which one can make any statement about the passive area for a general 
n-particle configuration is the kt algorithm. 

Because of the kt distance measure, the single ghost will cluster with one of the event particles 
before any other clustering takes place. One can determine the region in which the ghost will 
cluster with a given particle, and this is a definition of the area ak t ,R(pi) of a particle pj. Since the 
ghost-particle clustering will occur before any particle-particle clustering, the jet area will be the 
sum of the areas of all its constituent particles: 

ak t ,R( J ) = a ktAPi) • (23) 
Pie J 
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Figure 3: The passive area of jets in a parton-level event generated by Herwig and clustered with 
the kt algorithm with R = 1. The towers represent calorimeter cells containing the particles, the 
straight (green) lines are the edges of the Voronoi cells and the shaded regions are the areas of the 
jets. 

Can anything be said about the area of a particle? The ghost will cluster with the event particle 
to which it is closest, as long it is within a distance R. There exists a geometrical construction 
known as the Voronoi diagram, which subdivides the plane with a set of vertices into cells around 
each vertices^ Each cell has the property that all points in the cell have as their closest vertex the 
cell's vertex. Thus the Voronoi cell is remarkably similar to the region in which a ghost will cluster 
with a particle. The only difference arises because of the limitation that the ghost should be within 
a distance R of the particle — this causes the area of particle i to be the area of its Voronoi cell V% 
intersected with a circle of radius R, Ci^R, centred on the particle. This leads us to define a Voronoi 
area for a particle, aj^pj), 

a^p^areaO^nC^). (24) 

Thus given a set of momenta, the passive area of a kt jet can be directly determined from the 
Voronoi diagram of the event |£| using eq. (j23[) and the relation 

aktAPi) = a R(Pi) ■ ( 25 ) 

It is quite straightforward to see that this result holds for the 2-particle case, because the Voronoi 
diagram there consists of a single line, equidistant between the two points. It divides the plane 
into two half-planes, each of which is the Voronoi cell of one of the particles (this is best seen in 
fig. [lb). The intersection of the halfplane with the circle of radius R centred on the particle has 
area |7ri? 2 u(Ai2/ R), and this immediately gives us the results eqs. Q, ([8]) according to whether 
the particles cluster into a single jet or not. 

4 It is this same geometrical construction that was used to obtain a nearest neighbour graph that allowed k t jet 
clustering to be carried out in NlnN time [23] . 

5 Strictly speaking it should be the Voronoi diagram on a y — <f> cylinder, however this is just a technical detail. 
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The Voronoi construction of the /^-algorithm passive area is illustrated for a more complex event 
in fig. [3[ One sees both the Voronoi cells and how their intersection with circles of radius R = 1 
gives the area of the particles making up those jets. 

Note that it is not possible to write passive areas for jet algorithms other than kt in the form 
eq. (|23p . One can however introduce a new type of area for a generic algorithm, a Voronoi area, in 
the form 

Pi6J 

While for algorithms other than kt (for which, as we have seen, ak ti n(J) = a% r{J)), this area is not 
in general related to the clustering of any specific kind of background radiation, it can nevertheless 
be a useful tool, because its numerical evaluation is efficient |24[ [23] and as we shall discuss later 
(sec. I3.4p . for dense events its value coincides with both passive and active area definitions. 



3 Active Area 

To define an active area, as for the passive area, we start with an event composed of a set of particles 
{pi} which are clustered into a set of jets {Jj} by some infrared-safe jet algorithm. However, instead 
of adding a single soft ghost particle, we now add a dense coverage of ghost particles, {gi}, randomly 
distributed in rapidity and azimuth, and each with an infinitesimal transverse momentumjfl The 
clustering is then repeated including the set of particles plus ghosts. 

During the clustering the ghosts may cluster both with each other and with the hard particles. 
This more 'active' participation in the clustering is the origin of the name that we give to the area 
to be defined shortly. It contrasts with the definition of section [2j in which the single ghost acted 
more as a passive spectator, and in particular could not cluster with any other ghosts (there weren't 
any). 

Because of the infrared safety of any proper jet algorithm, even the addition of many ghosts 
does not change the momenta of the final jets {J?,}. However these jets do contain extra particles, 
ghosts, and we use the number of ghosts in a jet as a measure of its area. Specifically, if the number 
of ghosts per unit area (on the rapidity-azimuth cylinder) is v g and M g (J) is the number of ghosts 
contained in jet J, then the (scalar) active area of a jet, given the specific set of ghosts {gi\ is 

A(J\{ 9i }) = ^-. (27) 

V 9 

An example of jet areas obtained in this way is shown in figure [H One notes that the boundaries 
of the jets are rather ragged. Clustering with a different set of ghosts would lead to different 
boundaries. This is because the ghosts can cluster among themselves to form macroscopic subjets, 
whose outlines inevitably depend on the specific set of initial ghosts0 and these then subsequently 
cluster with true event particles. This can happen for any density of ghosts, and thus the jet 
boundaries tend, for most algorithms, to be sensitive to the randomness of the initial sets of ghosts. 

This randomness propagates through to the number of ghosts clustered within a given jet, even 
in the limit v g — > oo, resulting in a different area each time. To obtain a unique answer for active 

6 In most cases the distribution of those transverse momenta will be irrelevant, at least in the limit in which the 
density of ghosts is sufficiently high. 

7 This is a form of dynamic magnification of the microscopic local breaking of translational invariance introduced 
by the ghosts' randomness. 
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Figure 4: Active area for the same event as in figure [3j once again clustered with the k t algorithm 
and R = 1. Only the areas of the hard jets have been shaded — the pure 'ghost' jets are not shown. 

area of a given jet one must therefore average over many sets of ghosts, in addition to taking the 
limit of infinite ghost densityH 

A(J)= lim (A(J\{ 9i })) . (28) 

Note that as one takes v s —* do, the ghost transverse momentum density, v g {gt)i is to be kept 
infinitesimal. 

The active area should bear a close resemblance to the average susceptibility of the jet to a 
high density of soft radiation {e.g. minimum-bias pileup), since the many soft particles will cluster 
between each other and into jets much in the same way as will the ghosts. 

One may also define the standard deviation £( J) of the distribution of a jet's active area across 
many ghost ensembles, 

£ 2 (J) = lim {A\j\{ 9l })) ~A\J). (29) 

This provides a measure of the variability of a given jet's contamination from (say) pileup and is 
closely connected with the momentum resolution that can be obtained with a given jet algorithm. 

A feature that arises when adding many ghosts to an event is that some of the final jets contain 
nothing but ghost particles. They did not appear in the original list of { Jj} and we refer to them 
as pure ghost jets. These 'ghost' jets (not shown in fig. S]), fill all of the 'empty' area, at least in 
jet algorithms for which all particles are clustered into jets. They will be similar to the jets formed 
from purely soft radiation in events with minimum-bias pileup, and so are interesting to study in 
their own right. 

8 One may wonder if the averaged area (and its dispersion) depends on the specific nature of the fluctuations in 
ghost positions and momenta across ensembles of ghosts — for a range of choices of these fluctuations, no significant 
difference has been observed (except in the case of pure ghost jets with SISCone, whose split-merge step introduces 
a strong dependence on the microscopic event structure). 
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We can also define a 4-vector version of the active area (in analogy with the 4-vector passive 
area). It is given by 

MJ I {&}) = -4-T E 9»i , MJ) = lim (A^J | {g t })) . (30) 

The sum of the is to be understood as carried out in the same recombination scheme as used in 
the jet clustering]; 



3.1 Areas for 1-particle configurations and for ghost jets 
3.1.1 k t and Cambridge/ Aachen 

The active area for the kt and Cambridge/ Aachen algorithms is most readily studied numerically, 
by directly clustering large numbers of ghost particles, possibly together with one or more hard 
particles. This is feasible because of the availability of fast computational methods for carrying 
out the clustering in these algorithms, implemented in the Fast Jet package |23j . Typically we add 
ghosts with a density v g of ~ 100 per unit area in the rapidity region \y\ < y m ax — 6, and study 
jets in the region \y\ < y ma x — R- This leads to about 7500 ghost particles, which can be clustered 
in about 0.1s on a 3.4 GHz processor. Each ghost is given a transverse momentum ~ 10~ 100 GeV 
and the one hard particle that we study has a transverse momentum of 100 GeV. The results 
are insensitive to the values chosen as long as their ratio is sufficiently large. We investigate in 
Appendix [B] how the distribution of the "1-hard-parton" jet area gets modified when the transverse 
momentum of the parton is progressively reduced below the scale of a generic set of soft particles. 

Figure [5] shows the distribution of values of A(J \ {gi}) for pure ghost jets and jets with one hard 
particle. The distribution is obtained over a large ensemble of sets of ghosts f^l Let us concentrate 
initially on the case with a hard particle. Firstly the average active area, eq. (|28p differs noticeably 
from the passive result of irR 2 : 

^4fc t ,_R(one-particle-jet) ~ 0.812 ttR 2 , (31a) 
A^m.Mone-particle-jet) ~ 0.814 irR 2 . (31b) 

Secondly, the distributions of the area in fig. [5] are rather broad. The randomness in the initial 
distribution of ghosts propagates all the way into the shape of the final jet and hence its area. 
This occurs because the kt and Cambridge/ Aachen algorithms flexibly adapt themselves to local 
structure (a good property when trying to reconstruct perturbative showering), and once a random 
perturbation has formed in the density of ghosts this seeds further growth of the soft part of the 
jet. The standard deviations of the resulting distributions are 

Sfe t) R (one-particle-jet) ~ 0.277irR 2 , (32a) 
Scam,R (one-particle-jet) ~ 0.261 irR 2 . (32b) 

9 Though we do not give the details it is simple to extend the 4-vector active area definition to hold also for a 
general IR safe jet algorithm, in analogy with the extension of the passive area definition in eq. Q. 

10 They are placed on a randomly scattered grid, in order to limit the impact of the finite density, i.e. one effectively 
carries out quasi Monte Carlo integration of the ghost ensembles, so that that finite density effects ought to vanish 
as i/g 3 ^ 4 , rather than Vg 1 ^ 2 as would be obtained with completely random placement. 

11 In this particular case we have used about 10 7 separate random ghost sets, in order to obtain a smooth curve for 
the whole distribution. When calculating areas in physical events (or even at parton-shower level) the multiple real 
particles in the jet "fix" most of the area, and between 1 and 5 sets of ghosts particles are usually sufficient to obtain 
reliable area results (this is the case also for SISCone). 
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Figure 5: Distribution of active areas for pure ghost jets and jets with a single hard particle: (a) k t 
algorithm, (b) Cambridge/ Aachen algorithm. 

Figure [5] also shows the distribution of areas for pure ghost jets. One sees that pure ghost jets 
typically have a smaller area than hard-particle jetsr 2 ! 



and the standard deviations are 



A kuR (ghost-jet) ~ 0.554 ttR 2 , 

^Cam^fehost-jet) ~ 0.551 ttR 2 , 

E fctijR (ghost-jet) ~ 0.174 ttR 2 , 

S C am,i?(ghost-jet) ~ 0.176^R 2 . 



(34a) 
(34b) 



(35a) 
(35b) 



The fact that pure ghost jets are smaller than hard jets has an implication for certain physics 
studies: one expects jets made of soft 'junk' (minimum bias, pileup, thermal noise in heavy ions) 
to have area properties similar to ghost jets; since they are smaller on average than true hard jets, 
the hard jets will emerge from the junk somewhat more clearly than if both had the same area. 

12 Obtaining these values actually requires going beyond the ghost density and the rapidity range previously men- 
tioned. In fact, when going to higher accuracy one notices the presence of small edge and finite-density effects, 
0(R/(y m ax — R)) and 0(1/ (v g R 2 )) to some given power. Choosing the ghost area sufficiently small to ensure that 
finite-density effects are limited to the fourth decimal (in practice this means l/(v g R 2 ) < 0.01) and extrapolating to 
infinite y m ax one finds 

A kt (ghost-jet) ~ (0.5535 ± 0.0005) Tri? 2 , (33a) 
Acam (ghost-jet) ~ (0.5505 ± 0.0005) ttR 2 , (33b) 

with a conservative estimate of the residual uncertainty. This points to a small but statistically significant difference 
between the two algorithms. 
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Cone stability Split-merge 




Figure 6: Left: the hard particle with the stable cone (H) centred on it, an example of a cone 
(A) that is unstable because it also contains the hard particle, and of two cones (B) and (C) that 
contain just ghost particles and are therefore stable. Right: some of the stable ghost cones (thin 
blue circles) that have the largest possible overlap with (H) , together with the boundary of the hard 
jet after the split-merge procedure (dashed green line). In both diagrams, the grey background 
represents the uniform coverage of ghosts. 



3.1.2 SISCone 

The SISCone algorithm is unique among the algorithms studied here in that its active area is 
amenable to analytical treatment, at least in some cases. 

We recall that a modern cone algorithm starts by finding all stable cones. One stable cone is 
centred on the single hard particle. Additionally, there will be a large number of other stable cones, 
of order of the number of ghost particles added to the event [16]. In the limit of an infinite number 
of ghosts, all cones that can be drawn in the rapidity-azimuth plane and that do not overlap with 
the hard particle will be stable. Many of these stable cones will still overlap with the cone centred 
on the hard particle, as long as they do not contain the hard particle itself (see figure El left). 

Next, the SISCone algorithm involves a split-merge procedure. One defines pt for a jet as the 
scalar sum of the transverse momenta of its constituents. During the split-merge step, SISCone 
finds the stable cone with the highest pt, and then the next hardest stable cone that overlaps with it. 
To decide whether these two cones (protojets) are to be merged or split, it determines the fraction 
of the softer cone's pt that is shared with the harder cone. If this fraction is smaller than some 
value / (the overlap parameter of the cone algorithm), the two protojets are split: particles that 
are shared between them are assigned to the protojet to whose centra 13 ! they are closer. Otherwise 
they are merged into a single protojet. This procedure is repeated until the hard protojet no longer 
has any overlap with other protojets. At this point it is called a jet, and the split-merge procedure 
continues on the remaining protojets (without affecting the area of the hard jet). 

The maximum possible overlap fraction^ / maX ! between the hard protojet and a ghost protojet 
occurs in the situation depicted in figured] (right), i.e. when the ghost protojet's centre is just outside 

13 This centre is given by the sum of momenta in the protojet before the split-merge operation. 
14 The fraction of momentum coincides with the fraction of area because the ghosts have uniform transverse mo- 
mentum density. 
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the edge of the original hard stable cone (H). It is given by / max = 2 — u(l) = | — 0.391. This 
means that for a split-merge parameter / > / max (commonly used values are / = 0.5 and / = 0.75) 
every overlap between the hard protojet and a pure-ghost stable cone will lead to a splitting. Since 
these pure-ghost stable cones are centred at distances d > R from the hard particle, these splittings 
will reduce the hard jet to a circle of radius R/2 (the dashed green line in the right hand part of 
figure [6]). The active area of the hard jet is thus 

ttR 2 

^sisCone,_R(one-particle-jet) = — — . (36) 

This result has been verified numerically using the same technique employed above for kt and 
Cambridge /Aachen. 

Note that this area differs considerably from the passive area, irR 2 . This shows that the cone 
area is very sensitive to the structure of the event, and it certainly does not always coincide with 
the naive geometrical expectation ttR 2 , contrary to assumptions sometimes made in the literature 
(see for example [25J. 

We further note that in contrast to kt and Cambridge/Aachen algorithms, the SISCone algorithm 
always has the same active area for a single hard particle, independently of fluctuations of an 
infinitely dense set of ghosts, i.e. 

£sisCone,R(one-particle-jet) = 0. (37) 



SISCone ghost-jet areas. While the area of a hard particle jet could be treated analytically, 
this is not the case for the pure-ghost jet area. Furthermore, numerical investigations reveal that 
the pure-ghost area distribution has a much more complicated behaviour than for kt or Cam- 
bridge/Aachen. One aspect is that the distribution of pure-ghost jet areas is sensitive to the fine 
details of how the ghosts are distributed (density and transverse momentum fluctuations). Another 
is that it depends significantly on the details of the split-merge procedure. Figure [7^ shows the dis- 
tribution of areas of ghost jets for SISCone, for different values of the split-merge overlap threshold 
/. One sees, for example, that for smaller values of / there are occasional rather large ghost jets, 
whereas for / > 0.6 nearly all ghost jets have very small areas. 

One of the characteristics of SISCone that differs from previous cone algorithms is the specific 
ordering and comparison variable used to determine splitting and merging. As explained above, 
the choice made in SISCone was pt, the scalar sum of transverse momenta of all particles in a jet. 
Previous cone algorithms used either the vector sum of constituent transverse momenta, pt (an 
infrared unsafe choice) , or the transverse energy Et = E sin 9 (in a 4- vector recombination scheme) . 
With both of these choices of variable, split-merge thresholds / < 0.55 can lead to the formation of 
'monster' ghost jets, which can even wrap around the whole cylindrical phase space. For / = 0.45 
this is a quite frequent occurrence, as illustrated in figure [7b, where one sees a substantial number 
of jets occupying the whole of the phase space (i.e. clll £Lr6cl 47Tt/ max ~ 2AttR 2 ). Monster jets can 
be formed also with the pt choice, though it is a somewhat rarer occurrence — happening in 'only' 
~ 5% of events 

We have observed the formation of such monster jets also from normal pileup momenta simulated 
with Pythia [18], indicating that this disturbing characteristic is not merely an artefact related to 
our particular choice of ghosts. This indicates that a proper choice of the split-merge variable and 

15 This figure is not immediately deducible from fig.03, which shows results normalised to the total number of ghost 
jets, rather than to the number of events. 
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Figure 7: Distribution of pure-ghost jet areas for the SISCone algorithm, (a) with different values of 
the split-merge parameter / and (b) different choices for the scale used in the split-merge procedure. 
Ghosts are placed on a grid up to \y\ < 6, with an average spacing of 0.2 x 0.2 in y, (j), and a random 
displacement of up to ±0.1 in each direction, and transverse momentum values that are uniform 
modulo a ±5% random rescaling for each ghost. We consider all jets with y < 5. All jet definitions 
use R = 1 and multiple passes. 



threshold is critical in high-luminosity environments. The results from Figure [7^,, suggest that if 
one wants to avoid monster jets, one has to choose a large enough value for /. Our recommendation 
is to adopt / = 0.75 as a default value for the split-merge threshold (together with the use of the 
Pt variable, already the default in SISCone, for reasons related to infrared safety and longitudinal 
boost invariance). 

3.2 Areas for 2-particle configurations 

In this section we study the same problem described in section [2"T2| i.e. the area of jets containing 
two particles, a hard one and a softer (but still "perturbative") one, i.e. eq. (pE]), but now for active 
areas. As before, the results will then serve as an input in understanding the dependence of the 
active area on the jet's transverse momentum when accounting for perturbative radiation. 

3.2.1 k t and Cambridge/ Aachen 

As was the case for the active area of a jet containing a single hard particle, we again have to resort 
to numerical analyses to study that of jets with two energy-ordered particles. We define ^4ja,_r(Ai2) 
to be the active area for the energy-ordered two particle configuration already discussed in section 

Additionally since we have a distribution of areas for the single-particle active area case, it 
becomes of interest to study also £ja,r(Ai2) the standard deviation of the distribution of areas 
obtained for the two-particle configuration. 
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Figure 8: (a) Active areas (divided by irR 2 ) for the three jet algorithms as a function of the 
separation between a hard and a softer particle. For comparison we also include the passive areas, 
previously shown in fig. [2j (b) The corresponding standard deviations. 



The results are shown in figure _2 the active areas can be seen to be consistently smaller than 
the passive ones, as was the case for the 1-particle area, but retain the same dependence on the 
angular separation between the two particles. Among the various features, one can also observe 
that the active area does not quite reach the single-particle value (~ 0.8l7ri? 2 ) at A12 = 2R, but 
only somewhat beyond 2R. This contrasts with the behaviour of the passive area. The figure also 
shows results for the cone area, discussed in the following subsection. 

3.2.2 SISCone 

In the case of the SISCone algorithm it is possible to find an anal yti cal result for the two-particle 
active area, in an extension of what was done for one-particle easel 16 ! 

The stable-cone search will find one or two "hard" stable cones: the first centred on the hard 
particle and the second centred on the soft one, present only for A12 > R. The pure-ghost stable 
cones will be centred at all positions distant by more than R from both p\ and P2, i-e. outside the 
two circles centred on p\ and p2- 

16 This is only possible for configurations with strong energy ordering between all particles — as soon as 2 or more 
particles have commensurate transverse momenta then the cone's split-merge procedure will include 'merge' steps, 
whose effects on the active area are currently beyond analytical understanding. 
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d<R 



R<d<V2R 



V2 R < d < 2R 



Figure 9: Picture of the active jet area for 2-particle configurations in the case of the SIS Cone jet 
algorithm. The black points represent the hard (big dot) and soft (small dot) particles, the black 
circle is the hard stable cone. The final hard jet is represented by the shaded area. The left (a) 
(centre (b), right (c)) plot corresponds to A12 < R (R < A12 < V2R, \/2R < A i2 < 2R). 

We shall consider the active area of the jet centred on the hard particle p±. When A12 > R, the 
jet centred on the soft particle has the same area. 

As in section [3.1.21 the split-merge procedure first deals with the pure-ghost protojets overlap- 
ping with the hard stable cone. For / > / max , this again only leads to splittings. Depending of the 
value of A12, different situations are found as shown on figure [9j If the y-(f> coordinates of the parti- 
cles are p\ = (A,0) and p 2 = (0,0), the geometrical objects that are present are: the circle centred 
on pi with radius R/2; the tangents to this circle at y-<p coordinates (A12/4, ±| \/R 2 — (A12/2) 2 ); 
and, for A12 < R, the ellipse of eccentricity Au/R whose foci are p\ and P2 (given by the equation, 
A i + A a 2 = R, where a is a point on the ellipse). For Ai 2 < R the area of the jet is given by 
the union of the ellipse, the circle and the regions between the ellipse, the circle and each of the 
tangents (fig. EJa,). For R < A i2 < \[2R it is given by the circle plus the region between the circle, 
the two tangents and the line equidistant from p\ and P2 (fig. ©))• For \^2R < A12 < 2R it is given 
by the circle plus the region between the circle and two tangents, up to the intersection of the two 
tangents (fig. Eh). Finally, for A > 2R, the area is ttR 2 /A. 

An analytic computation of the active area gives 



with x = A12/-R, while for Ai 2 > 2R, one recovers the result irR 2 /4. The SISCone active area 
is plotted in figure [10] and it is compared to the results for kt and Cambridge/ Aachen in figure EJ 
One notes that the SISCone result is both qualitatively and quantitatively much further from the 
passive result than was the case for kt and Cambridge/ Aachen. 

The main reason why the 2-point active area is larger than the 1-point active area (whereas we 
saw the opposite behaviour for the passive areas) is that the presence of the 2-particle configuration 
causes a number of the pure-ghost cones that were present in the 1-particle case to now contain the 
second particle and therefore be unstable. Since these pure ghost cones are responsible for reducing 
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Figure 10: Active area of the hardest jet as a function of the distance between the hard and soft 
particle for the SISCone algorithm, cf. eq. (j38j) . 



the jet area relative to the passive result (during the split-merge step), their absence causes the 
active area to be 'less-reduced' than in the 1-particle case. 



3.3 Area scaling violation 

We can write the order a s contribution to the average active area in a manner similar to the passive 
area case, eq. (fT3|) : 

(A 3AtR ) = A JA ,ij(0) + (AA 3A , R ) , (39) 
where we have used the relation ^4jA,R(one-particle-jet) = Aj AtR (0) and where 

dP 



Pti 

dA 12 I dp t2 - — r— 

o/Ai 2 d Pt2 dA 12 



(A JA>R (A l2 ) - A JA ,fl(0)) . 



(40) 



Note that compared to eq. (|13p we have removed the explicit upper limit at 2R on the A 12 integral 
since, for active areas of sequential recombination algorithms, (Aj A ^(Ai 2 ) — Aj A ^ R (0)) may be non 
zero even for A12 > 2R. Note also the notation for averages: we use (• • •) to refer to an average 
over perturbative emissions, while (• • •) , implicitly contained in Aja,_r (see eq. (j28|) ). refers to an 
average over ghost ensembles. We now proceed as in section 12.31 and write 



C 

(AA 3A)R ) ~ Dj A)R —^- In 



a s (Qo) 



7t6 a s (Rpti) 



f df) 

Dj A , R = y o — (A JAjR (e) - a JAjR (o)) 



(41) 



where for brevity we have given just the running-coupling result. One observes that the result 
continues to depend on Qq, an indication that the active area is an infrared- unsafe quantity, just 
like the passive area0 The coefficients for the anomalous dimension of the active area for the 
various algorithms are 



D kuR ~0.52irR 2 , 

D Ca *n, R ^ 0. 08 7T R 2 , 
^SISCone.fl^O.1246^ 2 



(42a) 
(42b) 
(42c) 



17 We believe that any sensible (i.e. related to the jet's sensitivity to UE/pileup type contamination) definition of 
area would actually be infrared unsafe. 
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where the SISCone result has been obtained by integrating the analytical result, eq. (|38p . while the 
results for kt and Cambridge/Aachen have been obtained both by integrating the 2-point active- 
area results shown in fig. [8] and by a direct Monte Carlo evaluation of eq. (|4ip . Note that while the 
coefficients for kt and Cambridge/ Aachen are only slightly different from their passive counterparts, 
the one for SISCone has the opposite sign relative to the passive one. 

The treatment of higher-order fluctuations of active areas is more complex than that for the 
passive ones, where the one-particle area was a constant. We can separate the fluctuations of active 
areas into two components, one (described above) that is the just the one-particle result and the 
other, ( AS j A R ) , accounting for their modification in the presence of perturbative radiation: 

(£2 A>Jl ) = S5 A)H (0) + (A^ R ) , (43) 

where Sja,_r(0) is given by eqs. (f32h .b) for the kt and the Cambridge/ Aachen algorithms respec- 
tively, and it is equal to zero for the SISCone algorithm. The perturbative modification (A£j A R ) 
is itself now driven by two mechanisms: the fact that the second particle causes the average area to 
change, and that it also causes modifications of the fluctuations associated with the sampling over 
many ghost sets. We therefore write 

(AS^-^S-ln^,, (44) 
irb a s (Rpti) 

Sk* = J -j [(Aja,r(0) - AjaM®)) 2 + &ja,r( 9 ) ~ S JA,fi(0)] (45) 
f dQ 

= J Q -j{AaA°) - 4Lv,k(0)) - 2A JA , R (0)Dj A , R , (46) 

where as usual we neglect contributions that are not enhanced by any logarithm or that are higher- 
order in a s . The details of how to obtain these results are given in Appendix [Cj 

The coefficient S 2 AR can be determined only numerically for the kt and Cambridge/ Aachen 
algorithms, while for the SISCone algorithm the result can be deduced from eq. (138j) together with 
the knowledge that SsiSCone,fl(0) = 0: 

S 2 kt)R ^(0AlnR 2 )\ (47a) 

Scam, R ^(0.19^R 2 ) 2 , (47b) 

^SISCone.R^ (0-0738 ^ 2 ) 2 . (47c) 

Again, both the values and their ordering are similar to what we have obtained for the passive areas 
(see eq. ([22]) ). 

3.4 n-particle properties and large-n behaviour 
3.4.1 k t algorithm 

As for the passive area, the kt algorithm's active area has the property that it can be expressed as 
a sum of individual particle areas: 

A kt>R (J) = E A **M ■ ( 48 ) 

Pl eJ 

This is the case because the presence of the momentum scale kt in the distance measure means that 
all ghosts cluster among themselves and with the hard particles, before any of the hard particles 
start clustering between themselves. However in contrast to the passive-area situation, there is no 
known simple geometrical construction for the individual particle area. 
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3.4.2 Equivalence of all areas for large n 

The existence of different area definitions is linked to the ambiguity in assigning 'empty space' 
to any particular jet. In the presence of a sufficiently large number of particles n, one expects 
this ambiguity to vanish because real particles fill up more and more of the empty space and thus 
help to sharpen the boundaries between jets. Thus in the limit of many particles, all sensible area 
definitions should give identical results for a given jet. 

To quantify this statement, we examine (a bound on) the scaling law for the relation between the 
density of particles and the magnitude of the potential differences between different area definitions. 
We consider the limit of 'dense' coverage, defined as follows: take square tiles of some size and use 
them to cover the rapidity-azimuth cylinder (up to some maximal rapidity) . Define A as the smallest 
value of tile edge-length such that all tiles contain at least one particle. In an event consisting of 
uniformly distributed particles, A is of the same order of magnitude as the typical interparticle 
distance. The event is considered to be dense if A <C R. 

Now let us define a boundary tile of a jet to be a tile that contains at least one particle of that 
jet and also contains a particle from another jet or has an adjacent tile containing one or more 
particle(s) belonging to a different jet. We expect that the difference between different jet-area 
definitions cannot be significantly larger than the total area of the boundary tiles for a jet. 

The number of boundary tiles for jets produced by a given jet algorithm (of radius R) may 
scale in a non-trivial manner with the inter-particle spacing A, since the boundary may well have a 
fractal structure. We therefore parametrise the average number of boundary tiles for a jet, N^j^r 
as 

(R\ 
a) ' (49) 

where the fractal dimension 1 ja = 1 would correspond to a smooth boundary. The total area of 
these boundary tiles gives an upper limit on the ambiguity of the jet area 

(|o JA ,fl - A 3A , R \) < N btJA>R A 2 ~ i^A A 2-n JA > (50) 

and similarly for the difference between active or passive and Voronoi areas. As long as~I ja < 2 the 
differences between various area definitions are guaranteed to vanish in the infinitely dense limit, 
A — > 0. We note that "1 ja = 2 corresponds to a situation in which the boundary itself behaves like 
an area, i.e. occupies the same order of magnitude of space as the jet itself. This would be visible 
in plots representing jet active areas (such as fig. 0|), in the form of finely intertwined jets. We 
have seen no evidence for this and therefore believe that 1 < ~l ja < 2 for all three jet algorithms 
considered here. 

In practice we expect the difference between any two area definitions to vanish much more 
rapidly than eq. (f50|) as A — ► 0, since the upper bound will only be saturated if, for every tile, the 
difference between two area definitions has the same sign. This seems highly unlikely. If instead 
differences in the area for each tile are uncorrelated (but each of order A 2 ) then one would expect 
to see 

<|o JA ,fl - A JAiR \) ~ v^a^A 2 ~ flW2 A 2-n JA /2 _ (51) 

We have measured the fractal dimension for the kt and Cambridge/ Aachen algorithms and find 
"Ifct — ~Icam — 1.20 — 1.25@ Note that any measurement of the fractal dimension of jet algorithms 

18 This has been measured on pure ghost jets, because their higher multiplicity facilitates the extraction of a reliable 
result, however we strongly suspect that it holds also for single-particle jets. 
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in real data would be severely complicated by additional structure added to jets by QCD branching, 
itself also approximately fractal in nature. 

The fact that active and passive (or Voronoi) areas all give the same result in dense events has 
practical applications in real-life situations where an event is populated by a very large number of 
particles (heavy ion collisions being an example). In this case it will be possible to choose the area 
type which is fastest to compute (for instance the Voronoi area) and use the results in place of the 
active or passive one. 

4 Back reaction 

So far we have considered how a set of infinitely soft ghosts clusters with a hard jet, examining also 
cases where the jet has some finitely-soft substructure. This infinitely-soft approximation for the 
ghosts is not only adequate, but also necessary from the point of view of properly defining jet areas. 
However if we are to understand the impact of minimum-bias (MB) and underlying-event radiation 
on jet finding — the original motivation for studying areas — then we should take into account the 
fact that these contributions provide a dense background of particles at some small but finite soft 
scale ~ Aqcd- 

This has two main consequences. The first (more trivial) one is that the minimum-bias or un- 
derlying event can provide an alternative, dynamic infrared cutoff in the pt2 integration in equations 
such as eq. (JT3j) : assuming that the density of MB transverse momentum per unit area is given by 
p then one can expect that when pt2 <C irR 2 p, the presence of P2 will no longer affect the clustering 
of the ghosts (i.e. the MB particles). In this case the pt2 integral will then acquire an effective 
infrared cutoff ~ irR 2 p and in expressions such as eqs. (115|16f) . Qq will be replaced by irR 3 p. Note 
that neither with infinitely nor finitely-soft ghosts do we claim control over the coefficient in front 
of this cutoff, though we do have confidence in the prediction of its R dependence for small R,\^ 

The second consequence of the finite softness of the MB contribution is that the addition of 
the MB particles can modify the set of non-MB particles that make it into a jet. We call this 
'back-reaction' and it is the main subject of this section. 

That back reaction should happen is quite intuitive as concerns non-MB particles whose softness 
is commensurate with the MB scale. However the extent to which it occurs depends significantly 
on the jet algorithm. Furthermore for some algorithms it can also occur (rarely) even for non-MB 
particles that are much harder than the MB scale. 

As with the studies of areas, there are two limits that can usefully be examined for back-reaction: 
the illustrative and mathematically simpler (but less physical) case in which the MB is pointlike 
and the more realistic case with diffuse MB radiation. 

4.1 Back reaction from pointlike minimum-bias 

Let us first calculate back reaction in the case of pointlike minimum bias. We will consider minimum- 
bias particles with transverse momentum pt m distributed uniformly on the y-cfi cylinder with density 
v m <C 1. We use a subscript m rather than g to differentiate them from ghost particles, the key 
distinction being that pt m is small but finite, where pt g is infinitesimal. 

Let us consider the situation in which a particle p\, with large transverse momentum, pt\ 3> pt m , 
has emitted a soft particle p2 on a scale commensurate with the minimum-bias particles, pt2 ~ Ptm- 

19 The coefficient can actually be determined quite straightforwardly, however since its impact is of the same order 
as other effects that we neglect (i.e. free of any logarithmic enhancements) we leave its determination to future work. 
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We shall calculate the probability that p 2 was part of the jet in the absence of the minimum-bias 
particle, but is lost from it when the minimum-bias particle is added. This can be written 

dP ( £ R r.. . r,. dp 



dp, 



12 



dy 

\ dA\2~: j-r Hja,r(P2 G J\) HjA,r(P2 ^ J\\Pm) , (52) 

J J dp t2 dA 12 



where Hja,r(P2 G J\) is 1 (0) if, in the absence of p m , p 2 is inside (outside) the jet that contains 
p\. Similarly, Hja,r(P2 ^ Ji\Pm) is 1 (0) if, in the presence of p m , p 2 is inside (outside) the jet that 
contains p\. One can also define the probability for p 2 to not be part of the jet in the absence of 
the minimum-bias particle, but to be gained by the jet when the minimum-bias particle is added, 

dP JAR f f dP 

— — '— = / d(f) m dy m v m I ciAi 2 - jt — Hja,r(pz $■ Ji)Hja,r(P2 g Ji\Pm) ■ (53) 

dPt2 J J 1p t 2 a^-12 

It is convenient to factor out the particle production probability as follows, in the small R limit, 



dP { £ R . dP 

^12" 



dpt2 dp t2 dAu 



Ai2=R 



U rn bftl^Ptz/Ptm) , (54) 



where b^ R (pt2 / Ptm) can be thought of as the effective 'back-reaction area' over which the minimum- 
bias par ticle causes a loss of jet contents, given a ciAi 2 /Ai 2 angular distribution for the jet con- 



tents 
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d(f) m dy m J —^—Hja,r(P2 £ Ji) Hja,r(P2 ^ J\\Pm) ■ (55) 
One can similarly define an effective back-reaction area for gain, 



dP\t ] R a dP 

^12" 



Ai2=R 



v ™ bjA yR (pt2/Ptm) , (56) 



dpt2 dp t 2 dAi2 

//* 
d(j} m dy m J -^-^-Hja,r(P2 i J i) h JA,r(P2 g J l\Pm) ■ (57) 

For sequential-recombination algorithms the H functions in eqs. (|55l57p translate to a series of 
©-functions, e.g. 

Hk t ,R(P-2 G J\)Hk u R{P2 i J\\Pm) = e(R- Ai 2 )0(A 1(2+m) -R)x 

x 0(A im - min(l,p t2 /pt m )A 2m )6(Ai2 - min(l,pt m /pt 2 )A 2m )0(i? - A 2m ) , (58) 

for the hi algorithm and 

H C &m,R(p2 e Jl)H C&m ,R{p2 i Jl\Pm) = &{R ~ Al 2 )G(Ai( 2+m ) - R)x 

x 8(A lm - A 2m )0(A 12 - A 2m )0(i? - A 2m ) , (59) 

for Cambridge/ Aachen, where A 1( - 2+m ) is the distance between p\ and the recombined p 2 + p m . 
Evaluating integrals with the above ©-functions is rather tedious, but one can usefully consider 
the limit pt m <C pt2 "C pn- This of physical interest because it relates to the probability that the 



1 Strictly speaking it is the integral over area of the probability of causing a loss of jet contents. 
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Figure 11: The effective area for back-reaction as a function of the ratio of the soft perturbative 
scale pt2 and the point-like minimum-bias scale ptm, showing separately the loss (a) and gain (b) 
components for four jet definitions. 



minimum-bias particle induces changes in jet momentum that are much larger than pt m , and a 
number of simplifications occur in this limit. Since 



*l(2+m) 



? Ptm 7 

^12 H A>2rn 

Pt2 



Al2 + 



Ptm A i2 • A 



2 m 



Pt2 



V12 



P?2 



+ O ( ) . 



(60) 



P2 must be close to the edge of the jet in order for it to be pulled out by p m , |Ai2 — R\ <C 1. Without 
loss of generality, we can set y\ = 4>\ = 4>2 = 0, so that A12 = 2/2 — R, and 



A 



l(2+m) 



y2 + P —{ym-R) + 0[^R). 



Pt2 



Ptr 



Pt2 



(61) 



We can then carry out the integrations over (f)-jn and y2 straightforwardly, leading to following the 
result for loss at high p t 2 , 



In a similar manner one obtains for the gain. 



2R 



dy r , 



R 



(y m - R) ptm 

R Pt2 



2 Pt 



2^R^-(y m -Ry = ^R 2 
3 Pt2 
(62) 



G(K2/Pfm > 1) ^ b y c'(pt2/ptm » 1) 



R 



dy„ 



R/2 



(R - y m ) ptm 

R Pt2 



2y/R?-(R-y m )* = 
\ 3 4 / p t2 



The results for general pt2/ptm, determined numerically, are shown in figure [TT1 The SISCone 
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results are included and have the property that , 

4lSCone,ii(Wftm) =0 f °r ZT~ > T~f ' ^ 

Ptm 1 J 

b< SsCone, R (Pt2/Ptm) =0 for M. > LJ- . (64b) 

i.e. at high pt2> point-like minimum bias never induces back-reaction in the cone algorithm!^! in 
contrast to the situation with the sequential recombination algorithms, for which back-reaction 
occurs with a suppressed, but non-zero probability ~ pt m /pt2- On the other hand, for pt2 ~ ptm 
back-reaction is more likely with the cone algorithm — the effective area over which the MB 
particle can cause a change in jet contents is ~ 0.57ri? 2 , to be compared to ~ O.IttR 2 for the kt and 
Cambridge/Aachen algorithms. 

One may use the results eqs. (|62p - (|64p to determine the average change in jet-momentum due 
to back reaction. Because of the logarithmic spectrum of emissions dP/{dpt2 d/S.^), one finds that 
it receives contributions from the whole logarithmic region pt m < Pt2 < Pti > 



(G-L) f Ptl 
( A K,JA,k) - / d Pt2Pt2 

Jptm 



dP (G) dp {L) 
dp t 2 dp t 2 



Ci a s (p tm R) 

/3jA,RP-^-ln — — , (65) 

7r&o a s (ptiR) 



(evaluated for fixed coupling), where p = v m Ptm corresponds to the average transverse momentum 
of minimum-bias radiation per unit area and 

0JA,R = hm — \ bf2 R {Pt2/Ptm) ~ ^jAijfeMm)) • (66) 

The structure of the correction in eq. (|65p is very similar to that for the actual contamination from 
minimum bias, p( AajA,j?) , with ( Aoja,_r) as determined in section 12. 3t notably, for fixed coupling, 
the average back-reaction scales with the logarithm of the jet pt- The coefficients f3jA,R, 

Pk t ,R = /?Cam,R = ~^R 2 oi -0.1378vri? 2 , (67a) 

/?SISCone,iJ = . (67b) 

can be directly compared to the results for the djA,R there. The values are relatively small, similar 
in particular to what one observes for the Cambridge/Aachen algorithm, though of opposite sign. 

Though the average change in jet momentum, both from scaling violations of the area and from 
back-reaction, have a similar analytical structure, it is worth bearing in mind that these similar 
analytical structures come about quite differently in the two cases. Regarding area scaling violations, 
a significant fraction of jets, ~ a s Inpti/ptm, are subject to a change in area ~ R 2 (cf. section l2~3l) . 
and a consequent modification of the minimum-bias contamination by a modest amount ~ ptm- in 
contrast the average back reaction effect ~ a s pt m ^Pti/ptm, is due to large modifications of the 
jet momentum ~ pt2 with pt m <€i pt2 *C Pti occurring rarely, with a differential probability that 
depends on p t 2 as ~ a s dp t 2Ptm/p 2 2- O ne consequence of this is that the mean square change in 
transverse momentum due to back-reaction is dominated by very rare modifications of jets in which 
Pt2 ~ Pti, giving 



JA R ) ) ~ UsPtWtmVm ■ (68) 



J t,JA,R 



21 For large but finite p t i, back-reaction can actually occur beyond the above limits, but only with probability 

' Ptm / Ptl- 
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Note that the coefficient of this dispersion is non-zero even for the SISCone algorithm, due to 
the residual probability ~ Ptm/pti that it has, for finite pti, for any change in structure with 

Ptm < Pt2 < Ptl ■ 

4.2 Back reaction from diffuse MB 

Let us now examine back reaction in the case where the minimum-bias radiation has a uniform 
diffuse structure, consisting of a high density of minimum-bias particles, v m S> 1. The relation 
between back reaction in the point-like and diffuse cases is rather similar to the relation between 
passive and active areas: key analytical features remain unaffected by the change in the structure 
of the minimum bias, but certain specific coefficients change. 

We define the probability for loss in the presence of diffuse MB as 

dP^R I f dP \ 

-P 5 = lim ( / dA l2 H JA , R (p 2 G Ji) H iKR (p 2 <£Ji\p)) , (69) 

dp t 2 v m -+oc \J dp t 2dA 12 / MB 

P=v m (ptm) fixed 

where the average is performed over the ensemble of MB configurations, and now Hja,r(p2 $ Ji \ p) 
is 1 (0) if p 2 is outside (inside) the jet containing p% in the presence of the specific minimum-bias 
instance. A similar equation holds for the gain probability. 
Then, as with eq. (|54p we factorise this, 



dP$Jl a dP 



dp t 2 dp t2 dA 



12 



b$Mp) . ( 7 °) 



into one piece related to the probability for perturbative emission and a second piece B^ R that is 
the diffuse analogue of the effective back-reaction area in the point-like case. Note however 

that it is not so obvious exactly what geometrical area B^ R actually corresponds to@ Similarly, 

(G) (G) 

we introduce dPj A R /dpt 2 and Bj^ R corresponding to the gain in the presence of a diffuse MB. 

(L G) 

The only case for which we have analytical results for -Bj A ' R (pt 2 / p) is the SISCone algorithm, 
for which (with / > / max ~ 0.391) 

4Scone>*2/p) = ^SCone,* Wp) = • (71) 

This is a consequence of the facts (a) that the addition of a uniform background of MB particles 
has no effect on the stability (or instability) of a specific cone, (b) that for pt 2 <C pti the split-merge 
step is immaterial to the jet finding if p 2 is within the cone around p%, and (c) that for p 2 outside 
the cone around pi, the maximal possible overlap is of p 2 s stable cone with that of p\ is / max arid 
if / > /max then the two cones will always be split, ensuring that p 2 remains in a jet distinct from 
p\. We believe that real-life corrections to the zero in eq. (|7ip are proportional to the standard 

deviation of MB transverse- momentum density from point to point in the event. 

(L G) 

Numerical results for the -Bj A ' R (pt 2 / p) for the kt and Cambridge / Aachen algorithms are given in 
in figure H2] and compared to the results in the point-like case. One sees that the general functional 



22 A related issue is that the precise choice of normalisation of Sj^ a ls somewhat arbitrary — our specific choice 
is intended to provide a meaningful connection with b) j I ^ R in the large pt2 limit. 
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Figure 12: Numerical results for the diffuse effective back-reaction 'area', Bj^J(p t 2/p), for the kt 

(L G) 

and Cambridge/ Aachen algorithms, with the point-like results 6j A ' 1 ypti/Vtm) shown for comparison 
also. Results obtained for R = 1 and verified also for R = 0.7. 



form is rather similar though the normalisations are somewhat smaller (by a factor of 2 for loss, a 
factor ~ 10 for gain). The asymptotic large-pt2 behaviours are observed to be 



<iowp) - b, 



(L) 
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im/p) 
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Pt2 
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Pt2 
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As in the point-like case we can calculate the mean change in jet transverse momentum due to back 
reaction and we obtain 



pa 



dpt2Pt2 



Vtm 



dP 



(G) 



JA,R 



dP 



JA,R 



dp 



with 



B.1A. 



R 



y Pt2 

lim — 

Pt2^O0 p 



12 



D 



dp 



12 



d a s (pR 3 ) 
i3ja,r p - — In 



7t6 a s {pt\R) 



(G) 



(73) 



(74) 



Even though b^f^ had pta/ptm as its argument and B^ G ^ has Pis/p, the final expressions for 
the average back-reaction in the point-like and diffuse cases, eqs. ([55]) . (|75j) . have almost identical 
forms — in particular the overall scale appearing in each is p and the only difference appears in the 
denominator for the argument of the logarithm. The coefficients are slightly smaller, 

Bn* m r ^ -O.WnR 2 , 



&k t ,R 



Cam,/? 
£>SISCone,R 



0, 



(75a) 
(75b) 



and will again translate to modest effects compared to the overall minimum-bias contamination in 
the jets. Note also however that in general the scaling with R of b^'jp (pt2 /ptm) and B^^ (p t 2 / p) is 
subtly different. The former truly behaves like an area, in that # (ptz/Ptm) I B 2 is -R-independent; 



the latter instead has the property that it is Bf. L '^\R 2 p t 2/ p) that is R- independent. 
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5 Areas in (simulated) real life 



In this section we shall examine the properties of jet areas in the context of realistic events, as 
simulated with Herwig [T7j and Pythia [18] . There are two purposes to doing so. Firstly we 
wish to illustrate the extent to which the simple single-gluon emission arguments of the previous 
section hold once one includes full parton-showering and hadronisation. Secondly, jet areas have 
the potential to play an important role in the estimation and subtraction of underlying event and 
pileup contamination, as discussed in [19]. The study of jet areas in realistic events can help to 
highlight some of the issues that arise in such a procedure. 

5.1 Jet area distributions and anomalous dimension 

Let us start with an investigation of the distribution of the areas of hard jets with p$ > 1 TeV in 
simulated LHC dijet events, fig. [TBI The area distributions are shown for various 'levels' in the 
Monte Carlo: just after parton showering, after hadronisation, both with and without an UE, and 
finally with a substantial pileup contribution (high-luminosity LHC, ~ 25 pp interactions per bunch 
crossing). We examine both passive and active areas. 

The passive areas distributions at parton-shower level, fig. 113b. are those that are most amenable 
to understanding in terms of our analytical results. Firstly one notes that the SISCone and Cam- 
bridge/Aachen algorithms have a clear peak at a = ttR 2 . These two algorithms both have the 
property (cf. section I2.2[) that the area is not affected by moderately collinear (A12 < R for SIS- 
Cone, A12 < R/2 for Cambridge/Aachen) soft particle emission. Thus it is possible, even in the 
presence of parton showering (which is mostly collinear), for the passive area to remain ttR 2 . For 
the cone algorithm, the other main structure is a "shoulder" region 0.8 < a/(nR 2 ) < 1, which 
coincides nicely with the range of values that can be induced by 2-particle configurations (cf . fig. [2]) . 
A similar shoulder exists for the Cambridge algorithm, which however additionally has jets with 
a > ttR 2 — again the range of values spanned, up to a ~ 1.6ttR 2 , is rather similar to what would be 
expected from the two-particle calculations. Further broadening of these distributions at the edges 
is attributable to parton- level states with more than two particles. In contrast the parton- level 
passive area distribution for the kt algorithm seems less directly related to the 2-particle calcula- 
tions of section I2.2L This can be understood by observing that the kt passive area is modified even 
by rather collinear emissions, and the multiple collinear emissions present in parton showers add 
together to cause substantial broadening of the area distribution. 

At parton shower level, there are relatively few particles in the event and there is no obvious 
boundary to the jet — the ghosts that we add provide a way of assigning that empty area. It is 
therefore not surprising to see significant differences between the two ways of adding ghosts, i.e. the 
passive and active area distributions. This is most marked for SISCone, as is to be expected from 
the results of section 13.1.21 which showed that for a 1-particle jet the active area is 7ri? 2 /4. There 
is a trace of this result in fig. [13b, where one sees that the SISCone distribution now extends down 
to A = 7ri? 2 /4. There is however no peak structure, presumably because even a highly collinear 
emission gives a slight modification of the area, cf. fig. [10] (the same argument given for the absence 
of a peak for the passive kt area). For the Cambridge/ Aachen and kt algorithms, there is less 
difference between active and passive area distributions, again as expected. 

As one moves to events with more particles, for example hadron level, fig. 113b and d, the particles 
themselves start to give a clearer outline to the jets. Thus the passive and active distributions are 
far more similar. This is less so for SISCone than the others, as is to be expected, for which one 
still sees a trace of the peak at a = ttR 2 . 
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Figure 13: Distribution of active and passive areas of the two hardest jets in a range of simulated 
LHC dijet events, with a minimum pt of 1 TeV in the Pythia [18] event generation (version 6.4, 
default tune). Only jets with \y\ < 2 have been included in the histogram; 'parton' indicates parton- 
level, 'hadron' indicates hadron-level with the UE switched off, 'UE' corresponds to hadron-level 
with UE switched on and in the 'pileup' case the UE-level event is supplemented with additional 
minimum-bias events corresponding to 0.25 mb _1 per bunch crossing (about 25 simultaneous inter- 
actions). For all jet algorithms we use R = 1. A ghost area ~ 0.02 was used throughout except 
for SISCone, where the ghost area was roughly 0.1 in the active area cases. Note that "area" in 
these plots corresponds to ajA,R,(Ji) and AjA._R(Jj) respectively for the passive and active areas. 
As such the latter has been averaged over ghost ensembles (see eq. (|28p ) and the dispersion is 
a consequence of the event and jet structure. We have found that 5 ghost ensembles were suffi- 
cient for kt and Cambridge/ Aachen algorithms, and 3 for SISCone (with pileup, 1 ensemble would 
actually have been enough). In contrast, the Cam/Aachen 1-particle jet result corresponds to 
irR 2 /NdN/dA{ 1-particle-j et | {g t } ) and serves to illustrate how the impact of variability in the event 
structure in real events has consequences rather similar to that of the variability of ghost-particle 
ensembles in the theoretical arguments of the previous sections. 
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For events with many particles, for example with the UE (fig. [T3"e ) and then pileup (fig. fT3T ) 
added, the difference between passive and active area distributions is so small that we show only 
the active area result. These last two plots are the most relevant to phenomenology. A feature 
of both is that the dispersion is smallest for SIS Cone (but significantly different from zero) and 
largest for k t , precisely as one would expect from eqs. (|43p and (|47p . Another feature is how for 
Cambridge/Aachen the dispersion is essentially that associated with S(0), the distribution being 
very similar to the 1-parton active area result, shown as the solid line. This similarity is strongest 
when there is pileup: the logarithmic enhancement that enters in eq. (|44p is reduced because the 
pileup introduces a large value for Qq ~ 20 GeV. In this case even the kt algorithm starts to have a 
distribution of areas that resembles the 1-parton active area result, and for SISCone one once again 
sees signs of the lower limit at A = irR 2 /4, the one-particle active area result. 

While the plots of fig. [13] provide an illustration of many of the features that we have discussed 
in the earlier sections of this paper, the fact that we have not systematically calculated area dis- 
tributions means that the discussion necessarily remains qualitative. For quantitative checks, one 
should instead examine the mean active area and its dispersion and compare them to the results of 
section [3j This is done in fig. [HI separately for qq — > qq and for gg — ► gg scattering, as a function 
of the jet transverse momentum. The horizontal scale has been chosen uniform in lnlnp^/A so that 
our predictions correspond to straight lines. 

The predictions of section [3] set the slope of those lines for each algorithm, and the agreement is 
reasonable in all cases. The infrared cutoff scale Qq is not predicted, and may differ both between 
quark and gluon jets and between algorithms. The values for Qq have therefore been fitted, using 
the results for the mean, and are consistent with a general non-perturbative origin (modulo issues 
in SISCone discussed below). The standard deviation (indicated by the band for the Monte Carlo 
simulation and the thin dashed lines for the theory result) is then entirely predicted, and also agrees 
remarkably well. Thus, overall, our simple analytical calculations provide a surprisingly successful 
picture of the mean and dispersions for various algorithms, over a range of jet transverse momenta. 
This is all the more striking considering that the calculation is based on just the first term in a 
series a™ln n pt, and is in part based on a small angle approximation. 

Some remarks are due regarding the Qq values. Two clear patterns emerge: it is largest for kt, 
smallest for SISCone, and systematically larger for gluon jets than for quark jets. In the case of 
SISCone with quark jets, the value is uncomfortably close to the value of Aqcd = 0.2 GeV used in 
the one-loop coupling. It may be that this low value is an artefact whose origin lies in the finite 
density both of actual event particles and of ghosts (the latter due to speed limitations in SISCone): 
when one has a limited density of particles and/or ghosts, the measured area may be intermediate 
between the passive and "ideal" active areas; because SISCone has such a large difference between 
passive and active areas (a factor of 4 for the 1-particle results), the finite density effects can be 
significant, and so Qq may be taking an extreme value in order to compensate for this. 

A final comment concerns the choice of event generator. Here we have shown results from Herwig 
(v. 6.5). Pythia with the original shower (the default choice in v. 6.4) is known to have difficulties 
reproducing anomalous dimensions associated with soft large angle radiation [261 127j . whereas the 
new shower [28j mostly resolves these issues |27| . Similar concerns are potentially relevant here 
too. We actually find that both Pythia showers give results similar to Herwig's (and ours) in all 
cases except the kt algorithm, for which both Pythia showers give a slope a factor of two smaller 
than expected. This suggests that an experimental measure of the pt dependence of jet areas might 
provide some non-trivial constraints on parton-showering dynamics. 
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Figure 14: The mean (solid line) and standard deviation (band) of the active area for each of the 
two hardest jets in qq — > qq and gg — * gg events simulated with Herwig 6.5, as a function of the 
jet transverse momentum. The theory curves (thick dashed for mean, thin dashed for standard 
deviation) correspond to eqs. ([39|) - ([^7]) with a 1-loop coupling (Aqcd = 0.2 GeV, nf = 5), and 
Qo fitted (for the mean values). The areas have been obtained with ghosts of size 0.02 for kt and 
Cambridge/Aachen, and 0.1 for SISCone. For all algorithms, R = 1. Note that the horizontal scale 
is uniform in lnlnj>t/A. Shown for pp collisions with y/s = 14TeV. 
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5.2 Jet areas and pileup subtraction 



One of the main uses of measured jet areas is in the subtraction of pileup [19]. As we have seen 
there are two contributions to the modification of a jet's pt in the presence of pileup: the pileup 
particles can end up in the jet, and assuming a roughly uniform distribution of pileup (as in the case 
in the limit of many simultaneous minimum-bias collisions), the resulting change in jet pt will be 
proportional to the jet's area, by definition. Of course the pileup is not exactly uniform in a given 
event and this must also be accounted for. The only other modification of the jet's pt comes from 
the back reaction. This gives us the following equation for the modification of a jet's transverse 
momentum in the presence of pileup, originally stated in [19] . 



where, as before, p is the mean amount of transverse momentum per unit area that has been added 
to the event by pileup; a measures the fluctuations of the pileup from point-to-point within the 
event (defined as the standard deviation of the distribution of pileup across many squares of area 
1); and Apf is the net change in transverse momentum due to back reaction. An important point 
here is that A is the area of the jet after the pileup has been added — the fact that the area is not 
an IR safe quantity means that it is not the same before and after pileup, and it is in the latter 
case that it correctly measures how much pileup will have entered the jet. 

The fluctuations in the jet pt due to the direct pileup contribution can be written 



where the area-related averages are those for events including the pileup {i.e. Qo ~ R 2 p in calcula- 
tions of anomalous dimensions), and we separately account both for point to point fluctuations of 
the pileup within an event (second term) , as well as fluctuations in the overall level of pileup from 
event to event (third term). 

The direct pileup contribution to a sample of jets is shown in fig. [15a corresponding to about 25 
pp interactions per bunch crossing (high-luminosity LHC), as simulated with Pythia tune A. One 
observes that SISCone has smaller average pileup contamination and smaller fluctuations in this 
contamination, which is consistent with our analytical calculations. More surprisingly, at least at 
first sight, kt has only slightly larger and broader pileup contamination than Cambridge/ Aachen, 
despite its larger area. However its area is generally larger only because of the anomalous dimension: 
since the logarithm that appears in the anomalous dimension with pileup, log pt/p + O (1), is not 
very large, the difference between kt and Cambridge/Aachen is essentially absent. This is visible 
also in fig. [T3f (the somewhat larger difference there is a consequence of the larger jet pt). 

The reader may have observed that we omitted the back-reaction term in eq. ([77]) . The tail 
of the distribution for Apf ~ p t: \ has the property dP/dApf ~ a s p/ {Apf) 2 } so the standard 
deviation is dominated by large Apf ~ pti, giving a contribution to to eq. (|77p of order a s ppt\. For 
large pti this is parametrically bigger than the direct pileup contributions, however it is dominated 
by extremely rare events (those in which a hard particle in the jet is just near the edge of the 
jet), a fraction of ~ a s pjpt\ of all events. Consequently it provides a poor estimate of the typical 
back-reaction. 

The actual distribution for the back reaction is shown in fig. 115b . and one sees that the change 
in momentum is nearly always limited to a few GeV, centred close to zero. It is slightly skewed 
towards loss, which is consistent with the larger loss than gain probabilities found in section [4"f^*l 

23 Actually, for the kt and Cambridge/ Aachen algorithms, one can verify that the high-Ap t tails of the distributions 
are consistent with the dP <L/G) /dAp t ~ B {L/G) Ci ^ (A P )2 expectation from section WI\ 




(76) 



(Ap 2 ) Pl] - (Ap t ) 2 PV ~ <E3 AtH ><p> a + (A JA , R ) (a 2 ) + (A^ R ) 2 ((p 2 ) - (p) 2 ) 



(77) 
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Figure 15: Both plots refer to events generated with Pythia 6.4 in which the two hardest jets 
have pt > 200 GeV and are both situated at \y\ < 2. Plot (a) shows the distribution of transverse 
momentum of the pileup particles that entered each of the jets; plot (b) shows the back reaction on 
the jet, i.e. the component of the net change in transverse momentum of the jet that is associated 
with reassignment of the non-pileup particles in the presence of pileup. The pileup was generated 
with Pythia 6.4. A jet in an event with pileup is matched to the jet in the event without pileup 
with which it overlaps most, the overlap being defined as the transverse component of the sum of 
the momenta of all constituents that are present in both jets (i.e. the method introduced in |10j). 



Actual pileup is not perfectly uniform and for this reason the back-reaction with SISCone is not 
exactly zero as would have been expected from section 14.21 for diffuse radiation. Nevertheless the 
gain in particular is smaller than for other algorithms, which happens to coincide with what we 
would expect from fig. [11] for loss and gain with pointlike pileup. 

Let us finally come to the question of subtraction of pileup. The equation proposed in [19] was 

pt; h) =Pt 3 -A 3P . (78) 

There we estimated p from the event itself and effectively subtracted both UE and pileup. Here let 
us estimate p purely from the pileup particles (with Monte Carlo simulations this is possible) and 
compare the difference between the pt added by the pileup and the pt removed by the our subtraction 
procedure. The distribution of this difference is shown in fig. [161 The two plots correspond to two 
different ways of measuring p (either directly from the scalar sum of pts of pileup particles, left, 
or with the median-based method of |19j . which is applicable also in data). In both cases the 
distributions are clearly centred on zero, indicating a successful subtraction of the pileup. 

Of the fluctuation terms in eq. (|77p . only the middle (^4ja,_r) (c 2 ) piece should remain after 
subtraction. This is consistent with the distributions being much narrower than those in fig. [T5"a . 
This is an important point as it means that, in addition to subtracting the average contamination 
due to pileup, we also reduce its effect on the pt fluctuations. For example, if one wants to look at 
the top mass spectrum, the corresponding peak will appear narrower (and with a better signal-to- 
background ratio) after subtraction than before. As a quantitative test, in fig. [TBI we show also a 
Gaussian of width (Aja,r) (& 2 }- Here a has been estimated in each event again either directly or 
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Figure 16: For the same events as in fig. [T5l the transverse momentum contamination from pileup, 
minus its estimate Ap with p estimated event-by-event and A obtained jet-by-jet (strictly for A we 
have used the transverse component of the 4- vector area). It is compared to a Gaussian distribution 
with standard deviation \J (cr 2 A) ~ */ {a 2 ) (A). In (a) p and a have been estimated by splitting 
the y-(j) cylinder into squares of area 1 up to \y\ = 3.5, calculating the sum of the scalar pt of the 
pileup particles in each of them; p and a are then given by the average and standard deviation of 
the contents of the squares. In (b) p and a have been evaluated by the procedure of [19], i.e. based 
on median and related percentiles of the distribution of pt/A for all jets up to \y\ = 3.5, carried out 
on the sample of jets obtained with the kt algorithm with R = 0.5. 
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with the methods of [19]. Whereas the p values obtained with the two methods are similar, those 
for a differ noticeably. This may in part be due to an occasional, hard pointlike contribution in 
the minimum-bias events, causing the fluctuations measured by a 2 to be non-Gaussian. This would 
explain why the statistically more standard evaluation of a shown in Fig. [T6a leads to slightly too 
broad a distribution. In contrast the evaluation of [19], in Fig. [T6b . is designed specifically to be 
insensitive to the pointlike component, on the grounds that one wants a measure not influenced 
by the main hard scatter. It provides a reasonable estimate of the lower half of the distribution, 
but undershoots the (asymmetric) upper tail, which one can probably ascribe to the rare pointlike 
component of the minimum bias collisions. 

6 Conclusions 

The concept of a jet area is one that initially seems rather intuitive. Yet, as we have seen in this 
article, there is a wealth of physics associated with it, both in terms of how one defines it and as 
concerns the interplay between a jet's internal structure and its area. This is reflected in the range 
of quantities that can be studied to characterize the behaviour of jet areas, summarised in table EJ 

Our guiding principle in defining jet areas has been that they should provide a measure of a 
jet's susceptibility to additional underlying event and pileup radiation, in the limit in which these 
are infinitely soft. Two opposite hypotheses for the structure of such radiation, pointlike or diffuse, 
lead to two definitions of the area, respectively passive or active, both calculated in practice with 
the help of infinitely soft "ghost" particles. The two definitions can be used with any infrared 
safe jet algorithm, and we have applied them both to sequential recombination algorithms {kt and 
Cambridge/Aachen) and to a stable-cone with split-merge algorithm, SISCone. 

The area of a jet may depend on its substructure. For the simplest case of a jet made of a single 
hard particle, the passive area coincides with one's naive expectation of ttR 2 for all jet algorithms 
considered here. The active area — that related to the more realistic, diffuse picture for UE and 
pileup — instead differs from nR 2 , the most surprising result being that for SISCone, whose active 
area is ttR 2 /4, a consequence of the split-merge dynamics (the widely used midpoint algorithm 
behaves similarly). Thus the widespread assumption that cone-based algorithms automatically 
have an area of ttR 2 is, in many cases, unjustified. 

Real jets of course consist of more than a single particle. The first level of substructure involves 
the addition of a soft particle to the neighbourhood of the jet. This modifies the jet area for all of 
the algorithms considered, again including SISCone, and we have seen that the average jet area then 
becomes an infrared unsafe quantity. Nevertheless it emerges that the effects of gluon emission can 
usefully be summarised in terms of an anomalous dimension, which encodes how the jet's average 
area depends on its transverse momentum. We have calculated this to leading order and seen that 
it agrees remarkably well with the pt dependence of measures of the jet areas in hadron-level Monte 
Carlo simulations. 

A summary of our main results for active and passive areas, their fluctuations and their anoma- 
lous dimensions is given in table [3j As is visible also in figure [bil for a broad range of pt, there is 
a hierarchy in the areas of the jet algorithms, (-4siSCone,i?) < (^Cam,fl) ^ ( A k t ,R)- A likely conse- 
quence is that SISCone jets will be the least affected by UE contamination, though the extent to 
which this holds depends on the precise balance between pointlike and diffuse components of the 
UE. The above hierarchy might also suggest an explanation for the opposite hierarchy in the size of 
hadronisation corrections, observed in Monte Carlo studies for the different jet algorithms in |10| . 
There the kt algorithm was seen to be least affected, which now appears natural, since a larger jet 
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quantity 


discussed in 


description 


Passive area 


Section [2] 


single-ghost area 


• average area 






o(lPJ) 


Sect. O 


passive area for a 1-particle jet 


(a) 


Eq. dH 


average passive area of jets with QCD branching: 
<a> = a(lPJ) + d&ln^ + . ; . 


d 


Eq. (HH) 


coefheient of leading scaling violations of (a) 


• died 11 UL* Lj LldljlUllo 






CT (,1-rJ) 




variance of one-particle passive area (0 by definition) 


\ a ) 


rijq. (|iy|) 


variance of passive area of jets with QCD branching: 
(0=^(1PJ) + S ^ln^ + ... 




rjq. [\zz\] 


coefficient of leading scaling violations of (cj ) 


Active area 


Section [3] 


many-ghost area (ghosts also cluster among themselves) 


• average area 






A(IP3) 


Eqs. (HH ES} 


active area for a 1-particle jet 


(A) 


Eq. ((39D 


average active area of jets with QCD branching: 
<A)=A(1PJ)+Z^ln^ + ... 


D 


Eq. g2D 


coefficient of leading scaling violations of (A) 


A(GJ) 


Eq. HMD 


average active area for pure-ghost jets 


• area fluctuations 






£ 2 (1PJ) 


Eqs. ([321 E7D 


variance of one-particle active area 


/V2\ 


-tjq. (|4oj) 


variance of active area of jets with QCD branching: 

( S ) = S ( 1PJ ) + S nb ln aI(Rp!) +■■■ 


s 2 


Eq. g7|) 


coefficient of leading scaling violations of (X 2 ) 


£ 2 (GJ) 


Eq. ([351) 


variance of active area for pure-ghost jets 


Back reaction 


Section d 


action of finite-momentum min-bias (MB) on the cluster- 
ing of the non-MB particles 


• pointlike MB 








Eqs. ([521 E3D 


probability of jet losing (L) or gaining (G) pt2 worth 
of non-MB particles 


b^ G \ P t2/ptm) 


Eqs. ([551 EZD 


eff. area for loss, gain (MB particle has _L mom. ptm) 


Q 

p 


-kq. Ijool) 


coeff. of net high-p+o gain— loss: limr,,„^ 00 ^-b^ G ~ L ^ (^-) 

Ptm K Vtra' 


• diffuse MB 






dp( L ' G )/d Pt2 


Eq. ([69D 


probability of jet losing (L) or gaining (G) pt2 worth 
of non-MB particles 


B^ G \m/p) 


Eq. cam 


eff. area for loss, gain (MB has 1-mom. density p) 


B 


Eq. ftSD 


coeff. of net high-j>t2 gain— loss: linip^^oo EQB( g ~ l \—) 



Table 2: A summary of the mathematical quantities defined throughout this article, together with 
descriptions of the associated physical concepts. 
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o(lPJ) A(1PJ) 


cr(lPJ) S(1PJ) 


d D 


s S 


A(GJ) S(GJ) 


kt 


1 0.81 


0.28 


0.56 0.52 


0.45 0.41 


0.55 0.17 


Cam/ Aachen 


1 0.81 


0.26 


0.08 0.08 


0.24 0.19 


0.55 0.18 


SISCone 


1 1/4 





-0.06 0.12 


0.09 0.07 





Table 3: A summary of the numerical results for the main quantities worked out in the article and 
described in table [2j All results are normalised to irR 2 , and rounded to the second decimal figure. 
Anomalous dimensions multiply powers of a™ In™ pt/Qo that for typical jet transverse momenta sum 
to something of order 1. Active-area and anomalous-dimension results hold only in the small- R 
limit, though finite-i? corrections are small. Back reaction is not easily summarised by a single 
number, and the reader is therefore referred to figs. [TT1 [I2j as well as to fig. [15] (right). 

is less likely to lose momentum through hadronisation. 

Jet areas, as well as being of interest from the point of view of understanding contamination 
from UE and pileup, are also the basis of recently developed methods for the jet-by-jet subtraction 
of this contamination [19]. The significant fluctuations of the area from one jet to the next mean 
that it is important to take into account the area of each individual jet, rather than assume some 
typical mean value. Among results here of relevance to the subtraction procedure, we highlight 
the demonstration that all areas (passive, active) are identical for highly populated events, which 
is important in ensuring that the subtraction procedure is free of significant ambiguities; we also 
remark on the calculation of back-reaction of pileup on jet structure, which is found to be a small 
effect. 

There are many avenues for potential further study of jet areas. The calculations presented here 
have extracted only the leading logarithmic part of the first non-trivial order in a s , and usually we 
have concentrated on properties of the mean area. The Monte Carlo results in section [5] also suggest 
interesting structures in the distributions of areas, and these merit further investigation. Another 
question for future work is that of the transition between passive and active areas if one considers 
a continuous transformation of the ghosts from pointlike to diffuse. Since real UE and pileup 
contamination is neither fully pointlike, nor fully diffuse, this question is of particular relevance for 
the stable-cone type algorithms, for which there is a large difference between the passive and active 
areas. 

Finally, an understanding of the behaviour of jet areas can play a key role in the choice of pa- 
rameters for jet algorithms, as well as in the design of new algorithms. One example of this concerns 
SISCone, for which we saw in section [3.1.21 that a split-merge overlap threshold / ~ 0.5 can lead to 
the formation of "monster jets," whereas a choice of / ~ 0.75, eliminates the problem, suggesting 
that the latter is a more appropriate default value. Another example is given in a companion paper 
|29j . where we show how to build an infrared and collinear safe jet algorithm, dubbed anti-fct, whose 
passive and active areas for single-particle jets are both irR 2 , and for which the area anomalous 
dimension is zero to all orders in a s . These are precisely the properties that one imagines for an 
ideal cone algorithm — but it is only once one has the tools for a quantitative discussion of jet 
areas that one may establish whether a given algorithm actually has these properties. 
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A Definitions of the three jet algorithms 

Throughout this paper, we have considered three jet algorithms: kt [H], Cambridge/ Aachen [15] 
and SISCone [IB], 

The first and the second of these are sequential recombination algorithms. They introduce a 
distance between each pair of particles and a beam distance diB for each particle. At each step, 
the smallest distance is computed. If it involves a pair of particles, they are recombined using a 
E'-scheme sum of four- momenta (i.e. direct addition of the four- momenta) , otherwise the particle 
is clustered with the beam and called a 'jet'. The definition of the distance for the kt algorithm is 

dij = min ( k 2 ti , fcj ) y%3 ~^ ^ , d iB = k 2 ti , (79) 



where Ayij = m — yj and A0y = — 4>j. For the Cambridge/ Aachen algorithm, the distance 
measures are 

Ay 2 - + AS 2 - 

dij = 3 n9 3 , d iB = 1 ■ (80) 



The SISCone jet algorithm is an infrared- and collinear-safe implementation of a modern cone 
algorithm. It first finds all stable cones of radius R, a stable cone being a circle in the (y, dj) plane 
such that the E'-scheme sum of the momenta inside the cone points in the same direction as the 
centre of the cone. It then runs a Tevatron run II type [22] split-merge procedure to deal with 
overlapping stable cones. The stable cones are ordered in p t , the scalar sum of the pt of a cone's 
constituents, to produce the initial list of protojets. One takes the hardest protojet and finds the 
next hardest one that overlaps with it (its scaled transverse momentum being ptj). If the shared pt 
is larger than fptj (/ is the split-merge overlap threshold parameter for the algorithm), they are 
merged, otherwise, the common particles are attributed to the protojet with the closer centre. If no 
overlaps are found, the protojet is added to the list of jets and removed from the list of protojets. 
This is repeated until no protojets remain. 

B Transition from one-particle jet to soft jet. 

Having observed the different properties of the area of pure ghost jets and jets containing a hard 
particle, one may wonder what happens to the area of a jet containing a "trigger" particle whose 
transverse momentum pt is progressively reduced, until it becomes much smaller (ultrasoft) than the 
momentum scale of a soft background, irR 2 p where, in the notation of section El p is the transverse 
momentum density per unit area of the soft background^ 

Figure [T7l shows, as one expects, that there are two asymptotic regions. At large pt, the area 
tends to that of a jet with an infinitely hard particle, eq. (|3ip . At small pt, it tends not to the pure 
ghost jet area, eq. (|34h . as one might naively expect, but rather to a value which can be predicted 
as 

, r \ fdAA 2 dN/dA . . 

^(ultrasoft-trigger-jet) = fdAAdN/dA , (81) 



24 Imagine the trigger particle being a B meson, which you could tag through a secondary vertex — you could then 
recognise its presence in any jet, regardless of its transverse momentum relative to the background. 
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Figure 17: Area of the jet containing a trigger particle when it is immersed in a bath of soft 
particles with transverse momentum momentum density p; shown as a function of the trigger 
particle transverse momentum. 



where dN/dA is the distribution of the number of pure ghost jets with a given area, and corresponds 
to the solid curves depicted in figure [5j This equation can be understood in the following way: when 
the momentum of the trigger particle becomes negligible compared to that of the soft background, 
it does not influence the size of its own jet. However, the likelihood of the trigger particle being 
found in a soft-background jet is proportional to that soft-background jet's area and one gets an 
extra factor of A in the integrand of the numerator of eq. (|8ip . One is able to use the pure-ghost-jet 
area distribution in eq. (f8~Tj) . because it coincides with that of soft background jets. 

Figure [T7] helps to illustrate the point that a diffuse background of particles such as pileup 
(represented here by the ghosts) provides an effective cutoff scale, below which a particle will not 
have any effect on the jet's area. Numerically, the effective cutoff does indeed coincide with the 
soft-transverse momentum scale irR 2 p. 



C Fluctuations of the active area 

In this appendix, we derive the results eqs. ([4"4" |) -(|46 p for the fluctuation coefficient Sj A R in the case 
of active areas. This is slightly more technical than for passive areas as we also have to deal with 
averages over ghosts distributions. Let us briefly recall our notation: (• • •) represents an average 
over perturbative emission, while (• • •) is an average over ghosts ensembles. When a quantity is 
evaluated for a specific ghost ensemble we will explicitly state so with notation of the form 
Aja,r{' ' ' I {fli})- From section [3] we have the implicit notation that a quantity specified without 
any mention of ghosts, such as ^4ja,r(0) is already averaged over ghost ensembles. 

At order a s , the mean active area can then be written (using ^4ja,_r(A = 0) = ^4jA,i?(one particle)) 

(A JA , R ) = ((A 3A , R (...\{ gi }))) g (82) 
Aj A> r{° I {^}) + I dP L4j A ,fl(A | {9i}) - A JA ,ij(0 | { gi })]j (83) 

- AjaA0) + D JA>R ^Ioe(^§^), (84) 
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with 



D JAiR = J ^ ((A JA , R (A | { 9l })) g - (A JA , R (0 I {9i})) 9 ). (85) 

Here, we have taken into account both the corrections due to the radiation of a soft particle and the 
average over the distribution of the ghosts. Note that we have used the shorthand dP to represent 
eq. (jlip . dP I \dpt2dA) , times dpt2dA, and that "• • • " in eq. (|82|) represents all possible perturbative 
states. 

For the corresponding fluctuations, the derivation goes along the same line 

(Z 2 A;R ) = ({A 2 AR (- ■ ■ I {gi}))) g - {Aja,r) 2 (86) 
'a 2 ja>r (0 I M) + J dP [A 2 AR (A I { 9l }) - A 2 A;R (0 I {g t })]) 

'a JAiR (0 I { 9l }) + JdP [A JAiR (A I { 9l }) - A JA:R (0 1 . (87) 



Neglecting the terms proportional to a 2 , we can write \ T, 2 A R ^ = £j A R (0) + ^A£j A where 

E 2 AjR (A) = (A 2 A>R (A I - (A 3A , R (A I , 

which for A = is the leading order result. We can also write 

Using straightforward algebra one obtains 

S 2 J A ,R = / X {^A,«( A I {ft}) " Aj AiR (0 I { 9l })]) g 

-2 (A 3A , R (0 I { 9l })) g ([A 3A , R (A I {<?,}) - A 3AjR (0 I { 9l })]) g } (90) 
^ <[^ A>i ,(A I { 9l }) - A 2 A R (0 I {5,})]), - 2 (A JAji? (0 I { ffi })) g D JA>R (91) 



<4j A ,*(A I { 9t }) - A JAtR (0 I { 9t })) 2 g + E;| AijR (A) - 5^(0) . (92) 



The second equality is a direct rewriting of the first. One gets to the last line by rearranging the 
different terms of the first one. The last two lines correspond exactly to eqs. ()46j) and (|45p of 
section I 
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